SETS WITH SMALL SUMSET AND RECTIFICATION 
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Abstract. We study the extent to which sets A C Z/AZ, N prime, resemble sets of 
integers from the additive point of view ( "up to Freiman isomorphism" ) . We give a 
direct proof of a result of Freiman, namely that if ^ K\A\ and \A\ < c(K)N then 

A is Freiman isomorphic to a set of integers. Because we avoid appealing to Freiman's 
structure theorem, we get a reasonable bound: we can take c(K) (32K)~ 12K ~ . 

As a byproduct of our argument we obtain a sharpening of the second author's result on 
sets with small sumset in torsion groups. For example if A C F£ , and if ^ -Kj-AI, 

then A is contained in a coset of a subspace of size no more than K 2 2 2K ~ 2 \A\. 



1. Introduction 

Additive questions involving integers are often simpler than the analogous questions for 
residues modulo a prime. For instance, the conjecture of Erdos and Heilbronn jE] that 
given k residues modulo a prime N, the number of sums of distinct pairs is at least 
min(2A; — 3, N) was open for over 30 years until Dias da Silva and Hamidoune proved 
it in e 5j, while the corresponding statement for integers is obvious. 

By rectification we mean a result that allows us to replace a set of residues by a set of 
integers which behaves identically from our point of view. To illuminate this terminology 
we quote Freiman [Zj: "It is known that sets of integers and sets of residues are different 
if considered from an additive viewpoint: addition of sets of integers takes place on "the 
line" whereas addition of sets of residues is performed "on the circle" . However in the 
case of noniterated addition of sets of residues which are not too large, this circle is 
"broken up" . This phenomenon expresses itself by the fact that such a set of residues 
is isomorphic to the corresponding set of integers." 

To make this more exact, let A, B be subsets of abelian groups, and let / : A — > B be 
1-1. We say that / is a Freiman isomorphism of order k, or i^-isomorphism for short, 
if for ai, . . . ,a k ,a[, . . . , a' k G A the equation 



a% H h a k = a[ H h a' k 



holds if and only if 



f(a 1 ) + ... + f(a k )=f(a' 1 ) + --- + f(a' k ). 

This means that additive properties involving at most k summands are preserved under 
such a map. For the important case k = 2 we shall use the term Freiman-isomorphism. 
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We say that a set of residues is rectifiable of order k, if it is Ffc-isomorphic to a set of 
integers (simply rectifiable, if k — 2). 

Suppose our set of residues modulo a prime N is contained in {m, m + 1, . . . , m + I}. 
The map which takes this interval into the interval {m, . . . ,m + 1} of integers is a F^- 
isomorphism if kl < N, and so such a set is rectifiable of order k if I < N/k. Since 
multiplication by a nonzero residue is clearly a Freiman isomorphism of any order, 
the same conclusion holds for a set contained in an arithmetic progression of length 
< N/k. Using the box principle one can show that this happens whenever the set 
has < log N/ log k elements, and we cannot say more if we only know the number of 
elements. 

The situation changes if we also know that our set has few sums. We write 

A + B = {a + b : a E A, b E B}, 
A — B is defined similarly, and we denote the /c-fold iterated sum by 

kA = A H h A. 

Now Freiman Ch. Ill, §4] proved that for any K there is a constant c(K) > with 
the following property: if A C Z/iVZ, if \A + A\ < K\A\ and if \A\ < c{K)N, then 
A is rectifiable. A simpler proof can be found in |2j. Both proofs apply Freiman's 
fundamental description of sets of integers with small doubling constant, that is to say 
satisfying \ A + A\ ^ K\A\ where K is to be thought of as very small in comparison with 
\A\. One formulation of this result asserts that such a set is contained in a generalized 
arithmetic progression 

P(a;q 1 ,...,q d ;h,...,l d ) = ja + ^a;^ : < Xi < h - lj , 

where the dimension d of P is bounded by d ^ fi{K) and its size \P\ — l\ . . . Id (which 
may be equal to its cardinality or may be larger) satisfies \P\ ^ f 2 (K)n. For different 
versions and proofs of this result see Freiman [7] , Ruzsa [T2J HB] , Bilu [T] and Chang [H] ; 
the best estimates of fi, f 2 are given in Chang's work. 

The aim of this paper is to give a direct proof of the rectification theorem. 

Definition 1.1. The diameter diamA of a set A (in Z or Z/mZ) is defined as the 
smallest integer / for which there exists some a, d such that A C {a, a + d, . . . , a + Id}. 

Here are our main results. In these statements, a E (0, 1) and K ^ 1 are arbitrary, 
whilst k ^ 2 is an integer. 

Theorem 1.2. Let N be a prime and let A C Z/iVZ be a set with \A\ = aN and 
min(|2yl|, \A — A\) = K\A\. Suppose that a ^ (1GK)~ 12K . Then the diameter of A is 
at most 

l2a 1/iK2 ^\og{l/a)N. (1.1) 

Theorem 1.3. Let N be a prime and let A C Z/iVZ be a set with \A\ = aN and 
\2A\ = K\A\. Then A is rectifiable of order k provided that a ^ (lGkK)~ 12K . 

Theorem 1.4. Let A C Z be a set with \A\ = n and \2A\ = Kn. Then A is Fk- 
isomorphic to a set contained in the interval [1, (16kK) 12K2 n\ . 
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As a byproduct, we will obtain an improved version of the analog of Freiman's theorem 
in torsion group (see Section 6). 

In [jU we deduce a covering property for sets with small doubling and use it to estimate 
iterated sums. In JJlwe connect diameter and large Fourier coefficients. In £0]we deduce 
the existence of large Fourier coefficients from the covering property. £0 contains the 
synthesis. 



2. Doubling, covering and the size of iterated sums 



The covering property of a set means that its sumset is covered by a few translations 
of the original set. We will show that if A has a small sumset, then its difference set 
possesses such a property, and this will be used to find bounds for the cardinality of its 
higher order sumset s. 

Lemma 2.1. Let A,Bi,B 2 be finite sets in an arbitrary commutative group. Write 



\A\ = n, \A + Bi 



Kjn. There is a set T C B\ + B 2 such that \T\ ^ 2K\K 2 — 1 and 
£i- £i + B 2 - B 2 C A- A + T -T. (2.1) 



Proof. For the proof we will use the following Pliinnecke-type inequality of the second 
author which may be found in [H], Theorem 6.1: 

Lemma 2.2. Let A,Bi,B 2 be finite sets in an arbitrary commutative group. Write 



\A\ =n, \A + Bi 



K^. There is a set A' C A such that 
\A' + B 1 + B 2 \ ^K X K 2 \A'\. 



Now take this set A' and select elements t\, t 2 , ■ ■ ■ , t& of B\ 
with the property that 



B 2 , as many as possible, 



j'-i 

(A' + tj )\\J(A' + U 



> 141/2. 



This property immediately implies that 



U(A' + 



i=i 



^ (l + k/2)\A'\ 



and since this set is contained in A' + B\ + B 2 , we can conclude that k ^ 2K\K 2 — 1. 

We set T = {t\, . . . , t k } and we show that it has property (j2.1|) . Take a generic element 
of Bi — Bi + B 2 — B 2 , say ui — u 2 with Uj G B\ + B 2 . Since neither Uj can serve as t k+ i, 
we have 

k 

{A' + uj) \ \J(A' + U) < \A'\/2. 

i=l 

This means that there is an a e A' such that 

k 

a + uj e \J{A' + ti) 



i=i 
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for j = 1,2, say a + uj = ctj + t iy By substracting these equations we obtain 

Ul - u 2 = ai - a 2 + t h - t h G A' - A' + T - T C A - A + T - T 

as required. □ 

Lemma 2.3. Let A be a finite set in an arbitrary commutative group. Write \A\ = n, 
and assume that min(|2A|, \A— A\) Kn. There is a set T C 2A such that \T\ ^ 2K 2 — 1 
and sitc/i t/iai /or every positive integer m we have 

(m + l)(A — A) CA-A + m(T - T). (2.2) 

Proof. We obtain the case m = 1 by substituting Bi = B 2 = A or B\ = B 2 = —A into 
(|2.1|) . The general case follows by an obvious induction. □ 

We think that it is often better to work with such a covering property than with the 
small doubling assumption. (One reason is that the covering property is preserved under 
a linear mapping, whereas the small doubling may not be.) 

Definition 2.4. We say that a set B (in a group) is k-covering if there is a set T such 
that |T| = k and B + BGB + (T — T) (whence (m + 1)B CBj- m{T — T) for every 
positive integer m). 

This particular form of the inclusion comes from our above results. A more complicated 
inclusion, say B + B O B + 100T — 200T would be almost as useful, since in estimating 
the size of mB for large m it is more important to have a bound on the size of T than 
on the number of its occurrences. 

We define J(k, m) as the number of fc-tuples of integers (xi, . . . , %k) satisfying 

k k 
i=l i=l 

Lemma 2.5. Let B be a finite set in an arbitrary commutative group. Assume that B 
is k-covering. For every positive integer m we have 

|(m + 1)B\ ^ \B\J(k,m). (2.4) 

Proof. Indeed, the inclusion (m + 1)B C B + m(T — T) yields 

|(m+l)£?| < \B\\m{T - T)\. 

If T = {ti, . . . , tk}, then every element of m(T — T) can be written in the form x iU 
with %i satisfying ()2.Hj) . hence 

\m(T-T)\ ^ J(k,m). □ 

Lemma 2.6. For m ^ k we have 

J(k,m) < (Um/k) k . (2.5) 

Remark. For large m/k J(k, m) behaves essentially as (4em/k) k , so at most the constant 
14 can be improved. 

Proof. Write Xi = yiSi with yi = \x{\ and Si = ±1. The number of possible choices for 
Si is 2 k . The y^s satisfy Y^Vi ^ 2m, and the number of such fc-tuples is 

2m + k\ ^ (2m + k) 2m+k k 

< — m ^ [7m k) . 

k J k k (2m) 2m v 1 ' 
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To see the last inequality here observe that with the notation k = tm, ^ t ^ 1 it 
reduces to 

(2 + t) 2+t ^4-7*. 

This holds for t = and t — 1, so it holds in between by the convexity of the function 
x log x. Thus we obtain 

k {2m + k\ ,,. n , k 



J(k,m)^2 k \^ k j^(Um/k) k . □ 

By combining the previous two lemmas we obtain the following one. 

Lemma 2.7. Let B be a finite set in an arbitrary commutative group. Assume that B 
is k-covering. For every positive integer m ^ k we have 

\(m + l)B\ < (Um/k) k \B\. (2.6) 

3. Large Fourier coefficients and diameter 

Our plan is to use the growth rate estimate of the previous section to find a large Fourier 
coefficient and to use it to estimate the diameter. This cannot be achieved directly, since 
the union of an interval and a few random numbers has a very large Fourier coefficient 
and a large diameter as well. We circumvent this difficulty as follows. We show that a 
large Fourier coefficient implies that a high percentage of our set is contained in a short 
progression, apply this to the difference set, and show that if a large part of A — A is in 
a progression, then the whole set A can be covered by a progression of the same length. 
This section contains the tools needed to carry through this project. 

Lemma 3.1. Let A C Z/7VZ be a set of size n. Assume that 

\(A- A)\[b,b + l]\ <n/2 

with some b and some I < N/3. Then AC [a, a + I] for some a. 

Proof. As every quantity involved is invariant under shifts of A, we may assume that 
G A (whence A C A — A) and that the longest gap in A is of the form [k, —1]. We 
have \A \[b,b + l]\ < n/2, so A' = An [6,6 + /] satisfies \A'\ > n/2. Now take an 
arbitrary a £ A. The set A' — a C A — A cannot be contained in (A — A) \ [b, b + 1], so 
a' — a E [b,b + I] for some a' G A'. Hence 

a G a' - [b,b + l] C [-/,/]. 

This holds for every a, so we have proved that A C [— 

This inclusion implies that there is a gap of length (N — I) — I > I. As we assumed 
that the longest gap ends at 0, there can be no element of A in [— /,0) and the above 
inclusion can be strengthened to AC [0, 1]. □ 

Now let B C Z/7VZ. For r G Z/A^Z we define the Fourier transform of B at r by 

B(r) = e(br/N) 
beB 

where, as usual, e{6) = e 2md . Clearly -B(O) = \B\ and sup r _^ |-B(r)| ^ \B\. We will be 
interested in situations where equality almost holds in the latter bound. 
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Lemma 3.2. Let B C Z/iVZ. Assume that e,5 are real numbers such that < e < 1, 
< 5 < 1/2 and \B(1)\ ^ (1 - 2e(l - cos n5))\B\. Then 

\B\[a,a + l]\ < e\B\ 

with suitable a and I < SN. 

This is Lev Theorem 1] with some changes in the notation. A more comfortable 
sufficient assumption for the conclusion is 

\B(l)\>(l-8eS 2 )\B\; (3.1) 

this follows from Lev's sharp bound via the inequality 

cos57r < 1 - AS 2 . 

Lemma 3.3. Let A C Z/iVZ. Write D = A — A, and suppose that 

\D(r)\ > \D\ -AS 2 \A\ 
for some S G (0, 1/3) and r ^ 0. Then diamA < SN. 

Proof. Write \A\ — n and \D\ = m. Assume first that r = 1. We apply the previous 
lemma (in the form ()3.1|0 with D in the place of B and e = n/2m. We get an interval 
of length < 5N that contains D with less than em = n/2 exceptions. Now Lemma f3. II 
implies that A is contained in an interval of length < SN. 

The general case follows by applying the special case to a suitable dilation of A. □ 

4. Finding a large Fourier coefficient 

With a future application in mind jH] we will describe the results here for general finite 
abelian groups. Let G be a finite Abelian group and let BOG. The Fourier transform 
B is defined for characters 7 G G* by 

beB 

We denote the principal character by 70. 

Lemma 4.1. Let G be a finite Abelian group with cardinality N, and suppose that 
BOG has size (3N . Assume that B is k-covering for some integer k ^ 2, and that 
(3 ^ 14~ fc_1 . Then there is a character 7 7^ 70 such that |-B(7)| ^ (1 — t])\B\, where 

7]=18(3 1/k log{l/(3)/k. (4.1) 

Proof. Let m be a positive integer, and write R = \(m + 1)B\. We will calculate high 
moments of B. We have 

where r m+1 (x) the number of (m + l)-tuples (61, . . . , b m+1 ) G B m+1 such that bi + ■ — h 
b m +i = x (equivalently, r m+ i is the (m + l)-fold autoconvolution of the characteristic 
function of B). We have 

J2rm + i(x)=B( l0 ) m+1 = \B\ m+1 

xeG 
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and by Cauchy-Schwarz we get 

J2r m+ i(x) 2 ^\B\ 2m+2 /R. 

Now Parseval's identity yields 

J2 |%)| 2m+2 = N^r m+1 (x) 2 > N\B\ 2m+2 /R. 

7GG* xeG 

On substracting the contribution of -8(70) = \B\ we obtain 

|5( 7 )| 2m+2 ^ (iV/i2- l)|5| 2m+2 . 

7^70 

Since 

^|%)| 2 = iV|5|, 



we can conclude that 



max|5( 7 )| 2m ^ (l/i2- l/N)\B\ 2n 

7^70 



A similar argument is used by Schoen [To] . 

We use the estimate R ^ (14m/k) k \B\ from (|2.fi|) (valid for m ^ k) and take (2ra)-th 
roots to obtain 

max \B(y)\ ^ ({k/U8m) k - f3) 1/2m \B\. (4.2) 

7^70 

This estimate is positive if (k/14m) k > (3, a condition which is satisfied by taking 

m = [k{2f3)^ k /U\ . 
The assumption (3 ^ 14 _fc_1 implies that m ^ k. With this choice ()4.2|) gives 



max 

77^70 



1^(7)1 ^/3 1/2m |5|. (4.3) 



One easily sees that 

m ^ kf3~ 1/k /3Q 

so that 

pl/2m = e -log(l//3)/2m > j _ M 1 //?) ^ 1 _« 

2m 

Substituting this into ff4.3j) gives f)4. lj) immediately. □ 

Lemma 4.2. Let G be a finite Abelian group of size N . Let A C G be a set for which 
min(|2A|, \A — A\) < K\A\. Write D = A — A, and suppose that \D\ = tN where 
t ^ \4r 2K . Then there is a character y ^ 70 such that |-D( 7 )| ^ (1 — ri)\D\, where 

r ] = 9K- 2 r l/2K2 \og(l/T). (4.4) 

Proof. This follows from the previous lemma, using that D is /c-covering with some 
k C 2K 2 — 1. □ 
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5. Assembling the pieces 
In this section we prove Theorems 11.21 11.31 and 11.41 

Proof of Theorem \l.°A Write D = A — A. We have \D\ ^ K 2 |A| by a result of the second 
author [TO]. In the notation of Lemma f4.2[ this means that r ^ K 2 a. By Lemma 14.21 
we know that there is an r ^ such that |-D(r)| ^ (1 — n)\D\, the value of i] being 
given by ()4.4|) (recall that r = \D\/N). By Lemma 13.31 we know that diamA < 5N if 
i]\D\ ^ 45 2 |v4| and 5 < 1/3. The first of these conditions will be satisfied if 5 = 2Kn 1 ^ 2 . 
The second will too, if we invoke the assumption a ^ (16K)~ 12K subject to which 
our theorem is being proved (this is a somewhat tedious check, substituting for rj from 
()4.4)l ). Another irritating calculation of an almost identical kind confirms that 5 satisfies 
the bound claimed in Theorem 11.21 □ 

Proof of Theorem We only have to check that the assupmtion on a implies that 
the S of Theorem II .21 satisfies 5 < 1/k. This is straightforward. □ 

Proof of Theorem \1.4\ With no loss of generality we may assume that A is one of the 
shortest sets in its isomorphy class. In other words, A C [1,L] and there is no 
isomorphic set contained in [1,L — 1]. Take a prime N e (kL,2kL). Let A' C Z/iVZ 
be the image of A; since N > kL, A and A' are Ffc-isomorphic. Write \A'\ — aN. Let 
I be the diameter of A'; we already know that I ^ L. By definition, this means that 
there is an A" C Z/iVZ which is homothetic (and a fortiori Ffc-isomorphic) to A' and 
contained in {1, 2, . . . , /}. The corresponding set A'" of integers is a set isomorphic to 
A and contained in [1,/]; by the minimality of L we infer I = L. Comparing this with 
the bound of Theorem 11.21 for I we obtain 

iV < 2kL = 2kl ^ 2Aka l/iR2 \/\og{l/ a)N. 

This implies that a > (16kK)~ 12K2 , which means that L ^ (lQkK) 12K2 n. This is what 
we wanted to prove. □ 

6. Sets with small sumset in a torsion group 

We recall that the exponent of a group G is the smallest positive integer r such that 
rg = for every g G G. 

Theorem 6.1. Let G be a commutative group of exponent r, and assume that A C G. 

(a) If \2A\ = K\A\, then A is contained in a coset of a subgroup of size 

<: K 2 A 2K2 ~ 2 ^\A\. 

(b) If \A — A\ = Kn, then A is contained in a coset of a subgroup of size 

^Kr\- 2K2 - 2 i\A\. 

The first result of this type is due to the second author [TJ] . In that paper the exponent 
of r is K A . A different approach was developed by Deshouillers, Hennecart and Plagne 
j3]. In its present forms, it works only for r = 2 and K < 4. For K < 3 their result is 
superior to ours. 

Proof. For a set X C G, we write genX to denote the subgroup generated by X. 
Our set A is contained in a coset of gen (A — A), which is the union of the increasing 
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sequence m(A — A). Hence (|2.2|) implies 

gen (A - A) C (A - A) + gen (T - T). 

with a suitable set T, |T| = ^ 2il 2 — 1. If T = {ti, . . . ,t k }, then gen (T — T) is 
generated by the k — 1 elements £j — ii, i = 2, . . . , k. So if every element has order ^ r, 
then 

|gen(T-T)| ^r^ 1 ^r^ 2 - 2 J, 

consequently 

|gen(A-A)| s= \A - A||gen (T - T)\ ^ |A - A|r L2x2 " 2j . 

This proves part (b). To obtain part (a) we recall that if \2A\ = Kn, then \A— A\ ^ K 2 n 
by a result of the second author (THj. □ 
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